An Efficient Method for Removing Deletion Errors in Quickly-spoken Connected Mandarin Digit String Speech Recognition
نویسندگان
چکیده
Connected Mandarin digit string speech, especially at rapid spoken rate, is very difficult to recognize correctly. In this paper, a new training method named neighboring digits pattern is proposed in order to eliminate most of deletion errors which frequently occur in Mandarin digits speech recognition at high speaking rate when we have enough quickly-spoken speech data as the training set. The complete implementation process and the corresponding data analysis are presented detailed and the performance is compared with that of the conventional system through the experiments. The results of comparison explain that the new method can reduce the deletion errors effectively, and thus improves the system recognition rate from 96.4% to 98.3%.
منابع مشابه
Neighboring Digits Pattern Training Method in Quickly-spoken Connected Mandarin Digits Speech Recognition
Deletion errors are most usually occurred in connected Mandarin digit string speech recognition when speaking rate is fast, and are the main reasons leading to the increasing of the recognition error rate and the decline of the recognition accuracy. In this paper, a new training method named neighboring digits pattern is given based on sufficient statistics of recognition errors of the traditio...
متن کاملDuration Modeling in Mandarin Connected Digit Recognition
Digit string recognition is required in many applications which need to recognize numbers such as telephone numbers, credit card numbers, date, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there e...
متن کاملImprovement in Connected Mandarin Digit Recognition by Explicitly Modeling Coarticulatory Information
The most successful training scheme for recognition of connected spoken digits is the segmental k-means algorithm, which implicitly captures the coarticulatory information of connected speech iteratively to establish reliable reference patterns. However, when this algorithm is applied to Mandarin digits, the obtained performance is inferior to that of English. Hence, a novel approach is propose...
متن کاملPerformance of Mandarin Connected Digit Recognizer with Word Duration Modeling
Digit string recognition is required in many applications such as automatic banking system, database information retrieving system, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there exist two mon...
متن کاملAn embedded word training procedure for connected digit recognition
The "conventional" way of obtaining word reference patterns for connected word recognition systems is to use isolatàd word patterns, and to rely on the dynamics of the matching algorithm to account for the differences in connected speech. Connected word recognition, based on such an approach, tends to become unreliable (high error rates) when the talking rate becomes grossly incommensurate with...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010